Picture for Qifeng Chen

Qifeng Chen

FastVMT: Eliminating Redundancy in Video Motion Transfer

Add code
Feb 05, 2026
Viaarxiv icon

Show, Don't Tell: Morphing Latent Reasoning into Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

Add code
Feb 02, 2026
Viaarxiv icon

FlyAware: Inertia-Aware Aerial Manipulation via Vision-Based Estimation and Post-Grasp Adaptation

Add code
Jan 30, 2026
Viaarxiv icon

TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment

Add code
Jan 27, 2026
Viaarxiv icon

Active Intelligence in Video Avatars via Closed-loop World Modeling

Add code
Dec 23, 2025
Viaarxiv icon

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Add code
Dec 23, 2025
Figure 1 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 2 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 3 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 4 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Viaarxiv icon

Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations

Add code
Dec 22, 2025
Viaarxiv icon

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Add code
Dec 19, 2025
Viaarxiv icon

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Add code
Dec 18, 2025
Figure 1 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Figure 2 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Figure 3 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Figure 4 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Viaarxiv icon